Overdispersed Generalized Linear Models

نویسندگان

  • Dipak K. Dey
  • Alan E. Gelfand
  • Fengchun Peng
چکیده

Generalized linear models have become a standard class of models for data analysts. However in some applications, heterogeneity in samples is too great to be explained by the simple variance function implicit in such models. Utilizing a two parameter exponential family which is overdispersed relative to a speciied one parameter exponential family enables the creation of classes of overdispersed generalized linear models (OGLM's) which are analytically attractive. We propose tting such models within a Bayesian framework employing noninformative priors in order to let the data drive the inference. Hence our analysis approximates likelihood-based inference but with possibly more reliable estimates of variability for small sample sizes. Bayesian calculations are carried out using a Metropolis-within-Gibbs sampling algorithm. An illustrative example using a data set involving damage incidents to cargo ships is presented. Details of the data analysis are provided including comparison with the standard generalized linear models analysis. Several diagnostic tools reveal the improved performance of the OGLM.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Approaches for Overdispersion in Generalized Linear Models

Generalized linear models (GLM's) have been routinely used in statistical data analysis. The evolution of these models as well as details regarding model tting, model checking and inference is thoroughly documented in McCullagh and Nelder (1989). However, in many applications, heterogeneity in the observed samples is too large to be explained by the simple variance function which is implicit in...

متن کامل

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

In analyzing complicated data, we are often unwilling or not confident to impose a parametric model for the data-generating structure. One important example is data analysis for proportional or count data with overdispersion. The obvious advantage of assuming full parametric models is that one can resort to likelihood analyses, for instance, to use AIC or BIC to choose the most appropriate regr...

متن کامل

On the EM algorithm for overdispersed count data.

In this paper, we consider the use of the EM algorithm for the fitting of distributions by maximum likelihood to overdispersed count data. In the course of this, we also provide a review of various approaches that have been proposed for the analysis of such data. As the Poisson and binomial regression models, which are often adopted in the first instance for these analyses, are particular examp...

متن کامل

On Hinde-Demetrio Regression Models for Overdispersed Count Data

In this paper we introduce the Hinde-Demétrio (HD) regression models for analyzing overdispersed count data and, mainly, investigate the e¤ect of dispersion parameter. The HD distributions are discrete additive exponential dispersion models (depending on canonical and dispersion parameters) with a third real index parameter p and have been characterized by its unit variance function + p. For p ...

متن کامل

Bootstrap Model Selection in Generalized Linear Models

Model selection is a central component of data analysis Though there are a variety of methods for likelihood based estimation methods there are relatively few for non likelihood based generalized linear models GLM such as in the quasi likelihood and generalized es timating equation GEE approaches In this paper we develop basic and bias corrected bootstrap approaches to estimate the predictive m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997